Duration High-Order Hidden Markov Models and Training Algorithms for Speech Recognition
نویسنده
چکیده
The duration high-order hidden Markov model (DHO-HMM) can capture the dynamic evolution of a physical system more precisely than can the first-order hidden Markov model (HMM). The relations among the DHO-HMM, high-order HMM (HOHMM), hidden semi-Markov model (HSMM), and HMM are presented and discussed. Recursive forward and backward probability functions for the partial observation sequence were derived, and were used to calculate the expected number of state transitions and to update the DHO-HMM’s parameters. Viterbi decoding and training algorithms for the DHO-HMM are also presented. Experimental results show that the proposed expectation-maximization (EM) training algorithm can obtain more reliable and accurate estimate of DHO-HMMs than the Viterbi training method. Experimental results also show that the DHO-HMM speech recognizer is superior to the HSMM and the baseline conventional HMM recognizers. In experiments, the DHO-HMM speech recognizer trained by the EM algorithm reduces recognition errors by up to 53% compared with the baseline HMM.
منابع مشابه
Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کاملA Study of Duration High-Order Hidden Markov Models and Training Algorithms for Speech Recognition
The duration high-order hidden Markov model (DHO-HMM) can capture the dynamic evolution of a physical system more precisely than the first-order hidden Markov model (HMM). The relationship among DHO-HMM, high-order HMM (HO-HMM), hidden semi-Markov model (HSMM), and HMM is presented and discussed. We derived recursive forward and backward probability functions for the partial observation sequenc...
متن کاملSpeaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words
متن کامل
Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words
متن کامل
شبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Inf. Sci. Eng.
دوره 31 شماره
صفحات -
تاریخ انتشار 2015